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^— H Abstract 

Given an undirected graph G = (V,E) with edge capacities c e > 1 for e G E and a subset 
^ T of k vertices called terminals, we say that a graph H is a quality-g cut sparsifier for G iff 

Q H T C and for any partition (A, 5) of T, the values of the minimum cuts separating A and 

B in graphs G and H are within a factor q from each other. We say that H is a quality-g flow 
sparsifier for G iff T C V(i/), and for any set 1} of demands over the terminals, the values of 
the minimum edge congestion incurred by fractionally routing the demands in D in graphs G 
and H are within a factor q from each other. 

So far vertex sparsifiers have been studied in a restricted setting where the sparsifier H is 
not allowed to contain any non-terminal vertices, that is V(H) = T. For this setting, efficient 
algorithms are known for constructing quality-0(log kj log log k) cut and flow vertex sparsifiers, 
c/} as well as a lower bound of (l(\/\og k) on the quality of any flow or cut sparsifier. 

We study flow and cut sparsifiers in the more general setting where Steiner vertices are 
allowed, that is, we no longer require that V(H) — T. We show algorithms to construct 
t-H constant-quality cut sparsifiers of size 0(C 3 ) in time poly(n) • 2 C , and const ant- quality flow 

sparsifiers of size c°^ oglosC ^ i n time n°( logC ) • 2 C , where C is the total capacity of the edges 
incident on the terminals. 

00 

1 Introduction 

o 

Suppose we are given an undirected graph G = (V, E) with edge capacities c e > 1 for e G E, and 
a subset T Q V of k vertices called terminals. Assume further that we are interested in routing 
traffic across G between the terminals in T. While the size of the graph G may be very large, 
the specific structure of G is largely irrelevant to our task, except where it affects our ability to 
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route flow between the terminals. A natural question is whether we can build a smaller graph 



H — (V',E r ), with T Q V, that approximately preserves the routing properties of graph G with 
respect to T. In this case, we say that H is a vertex sparsifier for G. Two types of vertex sparsifiers 
have been studied so far: cut sparsifiers, which preserve the minimum cuts between any partition 
of the terminals, and flow sparsifiers, which preserve the minimum edge congestion required for 
routing any set D of demands over T. 

More formally, given any graph G with capacities c e > 1 for the edges e E E, a subset T of 
vertices called terminals, and a partition (7a, 7b) of the terminals, let MinCutc(7A, Tb) denote 
the capacity of the minimum cut separating the vertices of Ta from the vertices of Tb in G. We 
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say that a graph H — (V\ E r ) is a quality-g vertex cut sparsifler, or just cut sparsifler, for graph 
G with terminal set T, iff T C V' and for every partition (7a, 7b) of T, MinCutG<(74, 7b) < 
MinCut^ (7^, T B )<q- MinCut G (7^, 7b). 

Given a graph G with capacities c e > 1 for every edge e E £, and a subset T C 1/ of vertices 
called terminals, a set 1} of demands over the terminals specifies, for every unordered pair (£, £') of 
terminals, a demand D(t, t'). A flow F is a routing of the set of demands, iff for every pair (£, £') of 
terminals, £ and t! send £ 7 ) flow units to each other. The congestion of F is the maximum, over 
all edges e G E, of F(e)/c e , where F(e) is the flow sent along e. Given a set D of demands over the 
set T of terminals, let rj(G, D) denote the minimum congestion required for routing the demands 
in D in graph G. We say that a graph H is a flow sparsifier of quality q for G, iff T C V^iJ), and 
for any set of demands over the set T of terminals, D) < 7?(G, D) < q • 77 (i?, D). 

For a vertex sparsifler H, we say that the vertices in V(H) \ T are Steiner vertices. Vertex cut 
sparsiflers were first introduced by Moitra [Moi09], and later Leighton and Moitra |LM10| defined 
flow sparsiflers and showed that they generalize cut sparsiflers. The main motivation in both papers 
was designing improved approximation algorithms for graph partitioning and routing problems. 
Specifically, if the solution value of some combinatorial optimization problem only depends on the 
values of the minimum cuts separating terminal subsets, then given any approximation algorithm 
for the problem, we can first compute a cut sparsifler H for graph G, and then run this algorithm on 
i7, thus obtaining an algorithm whose performance guarantee is independent of the size of G, and 
only depends on the size of the sparsifler H. Flow sparsiflers can be similarly used for combinatorial 
optimization problems whose solution value only depends on the congestion required for routing 
various demand sets over the terminals in G. The definitions of the cut and the flow sparsiflers of 
|Moi09[ ILMlOj however required that the sparsifler H does not contain any Steiner vertices, that 
is,V(H) = T. 

Moitra [Moi09j showed that there exist cut sparsiflers of quality 0(log/c/loglog k) even when 
no Steiner vertices are allowed, and Leighton and Moitra [LM10] proved the existence of quality- 
0(logfc/loglogfc) flow sparsiflers for the same setting, and obtained an efficient algorithm to con- 
struct quality-0(log 2 fc/loglog/c) flow and cut sparsiflers. Recently, Charikar et al. |CLLM10] , 
Englert et al. |EGK + 10) and Makarychev and Makarychev [MM10J have shown efficient algorithms 
to construct quality-0(logfc/loglog/c) flow and cut sparsiflers that do not contain Steiner vertices. 
On the negative side, Leighton and Moitra |LM10j have shown a lower bound of fi(loglogfc) on the 
quality of flow sparsiflers when no Steiner vertices are allowed. This bound was later improved to 
Q ( \/log k I log log k) by Makarychev and Makarychev |MM10j . Englert et al. |EGK + 10| have shown 
a lower bound of ( \/Iog k / log log k) on the quality of flow sparsiflers when no Steiner vertices are 
allowed, and all edge capacities of the sparsifler are bounded from below by a constant. As for 
cut vertex sparsiflers with no Steiner nodes, [CLLMlOj and [MM10] have shown a lower bound of 
ft(log 1/4 /c), and the results of |MM10j together with the results of [FJS88] give a lower bound of 
Q ( y/log k I log log k) on their quality. 

It is therefore natural to ask whether we can obtain better quality vertex sparsiflers by allowing 
Steiner vertices. In particular, an interesting question is: what is the smallest size 5(fe), such that 
for any graph G with a set T oik terminals, there is a constant-quality cut or flow sparsifler of size 
at most S(k). Notice that if our goal is to obtain better and faster approximation algorithms via 
graph sparsiflers, then the presence of Steiner nodes may actually lead to improved performance if 
we can construct better quality sparsiflers, while keeping the graph size sufficiently low. 

For simplicity, we first consider a special case where all edge capacities in the input graph G 
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are unit, and every terminal in T has degree 1 in G. In this case we show that there exist constant- 
quality cut sparsifiers of size 0(fc 3 ), and constant-quality flow sparsiflers of size 

fc OGo g iogfc) a we also 

show algorithms to construct these sparsiflers in time poly(n) • 2 k for cut sparsiflers and in time 
n O(\ogk) . 2 k f or now sparsiflers. We then generalize these algorithms to arbitrary edge capacities. 
Let C be the total capacity of the edges incident on the terminals, assuming that for each edge 
e E E, c e > 1. We show that there exist const ant- quality cut sparsiflers of size 0(C 3 ), and constant- 
quality flow sparsiflers of size C°^ og ^ ogC \ and show algorithms to construct such sparsiflers, with 
running time poly(n) • 2 C for cut sparsiflers and n°^ ogC ^ • 2 C for flow sparsifiers. 

We say that a graph H is a restricted sparsifier for graph G, if H is a sparsifler that is associated 
with a collection C of disjoint subsets of non-terminal vertices, and H is obtained from G by 
contracting every cluster S E C into a vertex. All sparsiflers that we construct are restricted 
sparsiflers. Interestingly, Charikar et al. [CLLM10] showed that when Steiner vertices are not 
allowed, the ratio of the quality of the best possible restricted flow sparsifler to the quality of 
an optimal flow sparsifler is super-constant. Moreover, Englert et al. |EGK + 10j have shown an 
ft(y/\og k) lower bound on the quality of sparsiflers that do not contain Steiner vertices, and can 
be obtained from convex combinations of O-extensions in graph G. 

We note that our techniques are very different from the techniques of |MM101ICLLM10}lEGK + 10j . 
who exploited the connection between vertex sparsiflers and O-extensions. Instead, we use well- 
linked decompositions and other techniques that are often employed in the context of graph routing. 

Our Results and Techniques We start with a simple construction of cut sparsiflers with Steiner 
vertices, which is summarized in the following theorem. 

Theorem 1 Let G — (V, E) be any n-vertex graph with capacities c e > 1 on edges e E E, and a 
setT^V of terminals. Let C denote the total capacity of the edges incident on the terminals, and 
let < e < 1 be any constant. Then there is a quality-(3 + e) vertex cut sparsifier H = (V f , E r ) for 
G, with \V'\ — 0(C 3 ). Moreover, graph H can be constructed in time poly(n) • 2 C . 

For simplicity, we give an outline of the construction for the special case where all edge capacities 
are unit, and the degree of every terminal is 1. Our algorithm relies on the notion of well-linkedness, 
and on a new procedure to compute a well-linked decomposition. Given any subset S of vertices, 
let out (S) denote the set of edges with exactly one endpoint in S. We say that S is a- well-linked, 
iff for any partition (A, B) of £, if we denote Ta = out (S) flout (A) and Tb = out (5) flout (5), then 
\E(A, B)\ > a • min {|7a| 5 |7b|}- Informally, we can set up an instance of the sparsest cut problem, 
on graph G[S] U out (5), where the edges of out (S) serve as terminals. Set S being a- well-linked 
is roughly equivalent to the value of sparsest cut in this new graph being at least a. The notion 
of well-linkednes^] has been used extensively in graph routing e.g. in [Ra c02t ICKS044 ICKS051 
RZ10J lAndlOj . and one of the useful tools for designing algorithms for routing problems is well- 
linked decomposition: a procedure that, given any subset S of vertices with | out^)! = z, produces 
a partition W of S into well-linked subsets. In all standard well-linked decompositions, we can 
ensure that \W\ is small (less than z), while each set X E W is guaranteed to be a- well-linked, 
where a = 1/ poly log z. We show a different well-linked decomposition, that instead ensures that 
every set X E W is 1/3- well-linked, and we can still bound the number of clusters in W by 0(z 3 ). 
An algorithm for constructing a cut sparsifler then simply computes a well-linked decomposition 

1 Our definition of well-linkedness is very similar to what was called bandwidth property in Rac02 ], and cut 
well-linkedness in [CKS05 , where we use the graph G[S] U out(S'), and the set of terminals is the edges of out(S'). 
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W of the set V(G) \ T of vertices, and contracts every cluster X E W. Since every cluster IeW 
is 1/3- well-linked, it is easy to verify that we obtain a constant-quality cut sparsifier. 

We now turn to the more challenging task of constructing flow sparsiflers. We again first 
consider a special case where all edge capacities are unit, and each terminal t E T has exactly one 
edge incident to it in G. We show that for this special case, there is a flow sparsifier H of quality 
68 and size fc o ( lo g lo g*0 ; where k = |T|. Recall that a sparsifier H is called a restricted sparsifier 
iff it is associated with a collection C of disjoint subsets of non-terminal vertices, and graph H is 
obtained from G by contracting each cluster S E C into a vertex. 

Theorem 2 Let G = (V,E) be any n-vertex (multi-) graph with unit edge capacities and a set 
Tcy of k terminals. Assume further that each terminal in T has exactly one edge incident to it 
in G. Then there is an algorithm that finds, in time 

n O(logfc) . 2 * a 

quality-q restricted vertex flow 

sparsifier H for G, with \V{H)\ = k°^ oglogk ^ and q = 68. 

It is then fairly easy to obtain the following corollary that extends the results of Theorem [2] to 
general graphs. 

Corollary 1 Let G = (V, E) be any n-vertex graph with edge capacities c e > 1 for e E E, and a set 
T cy of terminals. Let C denote the total capacity of all edges incident on the terminals, and let 

< e < 1 be any constant. Then there is an algorithm that finds, in time n 0( ^ ogC ^ • 2 C , a quality-q 
vertex flow sparsifier H for G, with \V(H)\ = C°^ oglogC ^ and q = 68 + e. 

We now outline our algorithm for constructing flow sparsiflers for the special case where all 
edge capacities are unit, and the degree of every terminal is 1. Let us assume for simplicity that 
the set R = V{G) \ T of vertices is 1/3- well-linked (we perform a well-linked decomposition as a 
pre-processing step to ensure this). One of the central notions in our algorithm is that of good 
routers. We say that a subset S C R of vertices is a good router iff it is 1/3- well-linked, and 
moreover, every pair of edges in out (S) can simultaneously send 1/z flow units to each other with 
constant congestion inside S, where z = | out (5)1. We say that a graph H is a legal contracted 
graph for G iff there is a collection C of disjoint good routers in graph G, and H is obtained from 
G by contracting every cluster S E C. It is easy to verify that if H is a legal contracted graph, then 
it is a constant quality flow sparsifier, since contracting the good routers in C may only affect the 
congestion of any routing by a constant factor. Our goal is then to find a legal contracted graph 
whose size is small enough. 

Notice that we have assumed that R = V{G) \ T is 1/3- well-linked. However, this is not 
sufficient to ensure that R is a good router, as the ratio between the minimum sparsest cut and the 
maximum concurrent flow, known as the flow-cut gap, can be as large as logarithmic in undirected 
graphs. To overcome this difficulty, we define several special structures that we call witnesses. If 
graph G contains such a witness, then we are guaranteed that R is a good router. For example, 
suppose that for some value < a < 1, graph G contains r = \ogk/a disjoint subsets Si, . . . , S r 
of non-terminal vertices, where for each 1 < j < r, subset Sj is a-well-linked, and there is a set 
Vj of edge-disjoint paths in G, connecting every terminal in T to some edge in out(Sj). For each 

1 < j < let Ej C out(Sj) be the set of k edges where the paths of Vj terminate. Since the 
flow-cut gap in undirected graphs is bounded by O(logfc), and the set Sj is a- well-linked, every 
pair of edges in Ej can simultaneously send k ^ gk = ^ flow units to each other with constant 
congestion inside Sj. If graph G contains such a witness {Si, . . . , S r }, it is easy to verify that R 
must be a good router, since we can send, for each 1 < j < r, 1/r flow units along each path in Vj, 
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so that each edge e E out (Sj) receives at most 1/r flow units, and then send ^ flow units between 
every pair of edges in Ej, with constant congestion inside Sj. In this way, every pair of terminals 
sends 1/k flow units to each other with constant congestion in G. 

Our algorithm proceeds as follows. Throughout the algorithm, we maintain a legal contracted 
graph G' of G, where at the beginning G' — G. As long as the number of vertices in G' is large 
enough, we perform an iteration, whose output is either a witness for set R being a good router, 
or another legal contracted graph G" that contains fewer vertices than G \ In the former case, we 
stop the algorithm and output a sparsifier H obtained from G by contracting the set R, and in the 
latter case we proceed to the next iteration. In fact, we can efficiently check whether R is a good 
router beforehand, by computing an appropriate multicommodity flow in G[R] : to ensure that the 
former case never happens. Once the size of the current graph G' becomes small enough, we output 
it as our final sparsifier. 

Organization: We start with preliminaries and notation in Section [2| We construct cut 
sparsiflers in Section [3] and flow sparsiflers in Section |4j 

2 Preliminaries and Notation 

In all our results, we start with a special case where all edges in G have unit capacities, and then 
extend our results to the general setting. Therefore, all definitions and results presented in this 
section are for graphs with unit edge capacities. 

General Notation For a graph G = (V, E), and subsets V' C V, E r C E of its vertices and edges 
respectively, we denote by G[V^], G\V, and G\E r the sub-graphs of G induced by V\ V \V\ and 
E\E f , respectively. For any subset S C V of vertices, we denote by outc(S') = Eg(S, V \ S) the 
subset of edges with one endpoint in S and the other endpoint in V \ S. When clear from context, 
we omit the subscript G. All logarithms are to the base of 2. 

Let V be any collection of paths in graph G. We say that paths in V cause congestion 77 in G, 
iff for each edge e E E(G), the number of paths in V containing e is at most 77. 

Given a graph G = (V, E), and a subset T C V of vertices called terminals, a set D of demands 
is a function D : T xT — > R + , that specifies, for each pair t,t f E T of terminals, a demand D(t, t'). 
For simplicity, we assume that the pairs £, £' of terminals are unordered, that is D(t,t') — D{t f ,t) 
for all t, t' E T. We say that the set D of demands is ^-restricted, iff for each terminal t E T, the 
total demand J2t'eT D{t,t') < 7. 

Given any set D of demands, a routing of D is a flow F, where for each unordered pair t, t r E T, 
the amount of flow sent from t to t r (or from t' to t) is D(t,t f ). The congestion of the flow is the 
maximum, over all edges e E E, of F(e) — the amount of flow sent via the edge e. 

Given any two subsets V\,V2 of vertices, we denote by F : V\ V2 a flow that causes 
congestion at most 77 in G, where each vertex in V\ sends one flow unit, and each flow-path starts 
at a vertex of V\ and terminates at a vertex of V2. We denote by F : V\ ^ v V2 a flow with the 
above properties, where additionally each vertex in V2 receives at most one flow unit. Similarly, 
we denote by V : V\ ^ v V2 a collection of paths V — {P v \ v E V{\ in graph G, where each path 
P v originates at v and terminates at some vertex of V2, and the paths in V cause congestion at 
most 77. We denote V : V\ V2 if additionally each vertex of V2 serves as an endpoint of at most 
one path in V . Similarly, we define flows and paths between subsets of edges. For example, given 
two collections Ei, E2 of edges of G, we denote by F : E\ ^ v E2 a flow that causes congestion at 
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most 77 in G, where each flow-path has an edge in E\ as its first edge, and an edge in E2 as its last 
edge, and moreover each edge in E\ sends one flow unit. (Notice that it is then guaranteed that 
each edge in E2 receives at most 77 flow units due to the bound on congestion). If additionally each 
edge in E2 receives at most one flow unit, we denote this by F : E\ E2. Collections of paths 
connecting subsets of edges to each other are defined similarly. We will often be interested in a 
scenario where we are given a subset S C V(G) of vertices, and E\^E2 Q out(S). In this case, we 
say that a flow F : E\ ^ E2 is contained in 5, iff for each flow-path P in F, all edges of P belong 
to G[S], except for the first and the last edges that belong to out (S). Similarly, we say that a set 
V : Ei ^ v E2 of paths is contained in £, iff all inner edges on paths in V belong to G[S}. 

Sparsest Cut and the Flow-Cut Gap Suppose we are given a graph G = (V,E), with non- 
negative weights w v on vertices v E V, and a subset T C V of k terminals, such that for all v T, 
w v = 0. Given any partition (A,B) of V, the sparsity of the cut (A,B) is m i n {^(^)^|l( j g)| ? where 
= J2veA w v an d W(B) = J2veB w v- I n the sparsest cut problem, the input is a graph G 
with non- negative weights on vertices, and the goal is to find a cut of minimum sparsity. Arora, 
Rao and Vazirani [ARV09] have shown an 0(y/\og /^-approximation algorithm for the sparsest cut 
problem. We denote by v4arv this algorithm and by a ARV (k) = 0(ylog k) its approximation factor. 
We will usually work with a special case of the sparsest cut problem, where for each t E T, w t = 1. 
We denote such an instance by (G, T). 

The dual of the sparsest cut problem is the maximum concurrent flow problem, where the goal 
is to find the maximum possible value A, such that each pair (£, t f ) of terminals can simultaneously 
send X/k flow units to each other with unit congestion (we assume that in the sparsest cut problem 
instance the weights wt = 1 for all t E T). The flow-cut gap is the maximum possible ratio, in any 
graph, between the value of the minimum sparsest cut and the value A of the maximum concurrent 
flow. The flow-cut gap in undirected graphs, that we denote by /3fcg(^) throughout the paper, is 
6(log k) |LR991 IGVY951 ILLR941 IAR98J . In particular, if the value of the sparsest cut in graph G 
is a, then every pair of terminals can send at least j^^tja now units to each other simultaneously 
with no congestion. It is also easy to see that any 1-restricted set D of demands on set T of 
terminals can be routed with congestion at most 2/3fcg(^)/<^- I n order to find this routing, let F 
be the flow where every pair of terminals sends ^p F ^ G ^ now units to each other with no congestion, 
and let F f be the same flow scaled up by factor /3fcg(^)/<^ 5 so the flow in F r causes congestion at 
most /3fcg(^)/^ 5 an d every pair of terminals sends 1/k flow units to each other. For each pair (£, t f ) 
of terminals, vertex t sends D(t, t')/k flow units to each terminal in T using the flow F r (scaled by 
factor D(t,t f )), and vertex t' collects D(t,t f )/k flow units from each terminal in T. It is easy to 
verify that, since the set D of demands is 1-restricted, the total congestion of this flow is bounded 
by 2f3 FCG (k)/a. 



Well-Linked Decompositions 



Definition 1 Given a graph G, a subset S of its vertices, and a parameter a > 0, we say that S 
is a-well-linked, iff for any partition (A,B) of S, if we denote by Ta — out (A) fl out (5), and by 
T B = out(S) nout(S), then \E(A,B)\ > a • min{|T A |, \T B \}. 

Given a subset S of vertices of G, we define a graph Gs associated with 5, and a corresponding 
instance {Gs^Tg) of the sparsest cut problem, that we use throughout the paper. We start by 
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sub- dividing every edge e E outc(5) by a vertex t e , and let Tg — {t e \ e ^ outc<(5)} be the set of 
these new vertices. We then let Gs be the sub-graph of the resulting graph, induced by 5 U T' s . 
Notice that set 5 is a-well-linked in G iff the value of the sparsest cut in instance (Gs,Tg) is at 
least a (for a < 1). In particular, if 5 is a-well-linked, and | out (5)1 = z, then we have the following 
two properties: 

PI. Any set D of 1-restricted demands on the edges of out (5) can be routed inside 5 with 
congestion at most 2/3fcg(^)/<^- 

P2. For any two subsets £4, E2 Q out (5), where |£i| = l^l, there is a collection V : £"i ~*|"i/ a ] E2 
of paths contained in 5. 

In order to obtain the latter property, we set up a single-source single-sink max-flow instance 
in graph Gs, where the edges of E\ serve as the source and the edges of E2 serve as the sink. The 
existence of the flow F : E\ ~**\/ a E2 follows from the max-flow/min-cut theorem, and the existence 

of the set V : E\ ^fi/ a i E2 of paths follows from the integrality of flow. 

A well-linked decomposition of an arbitrary subset 5 of vertices, is a partition of 5 into a 
collection of well-linked subsets. We use two different types of well-linked decomposition, that give 
slightly different guarantees. We start with a standard decomposition, that we refer to as the weak 
well-linked decomposition, and it is similar to the one used in [CKS05; Rac02j. The proof of the 
next theorem appears in the Appendix. 

Theorem 3 (Weak well-linked decomposition) Given any graph G — (V,E), and any subset 
S C V of vertices with | out (5)1 = z, there is an efficient algorithm, that finds a partition W 
of S, such that for each set R E W, |out(i?)| < |out(5)|, R is aw (z) -well-linked for qlw(z) — 

n (to^) ' and ^tew I out (^) I < L2 I out ( 5 ) I • 

The next theorem gives what we call a strong well-linked decomposition. This decomposition 
gives a better guarantee for the well-linkedness of the resulting sets in the partition. The drawback 
is that the running time of the algorithm is exponential in |out(5)|, and the number of edges 
adjacent to the subsets in the partition is higher. The proof of the next theorem appears in the 
Appendix. 

Theorem 4 (Strong well-linked decomposition) Given any n-vertex graph G — (V,E), and 
any subset 5 C V of its vertices, where G[S] is connected and | out(5)| = z, there is an algorithm 
running in time 2 Z • poly(n), that finds a partition S of S, such that: 

• For each R E S, | out(i?)| < | out(5)|, and R is 1/ ^-well-linked; 

• YjReS I °ut(i?)| = 0(z 3 ); and 

• For each i : 1 < i < 2 [log z\ , if Si C S denotes the collection of subsets R E S with 
z/2 i < I out(R)\ < z/2 i ~ 1 , then \Si\ < 2 3i+3 for all i. 

We will sometimes use the notion of well-linkedness in a slightly different setting. Suppose we 
are given a graph G — (V, E), and a subset T Q V of vertices called terminals. We say that G is 
a-well-linked with respect to T, iff for any partition (A, B) of V, if we denote Ta = T fl A and 
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Tb — Tfl B, then \E(A,B)\ > a • min{|T^|, |Tb|}. A convenient way of viewing this consistently 
with the previous definition of well-linkedness is to augment the graph G, by adding an edge 
connecting each terminal t E T to a new vertex vt- Saying that G is a- well-linked for T is then 
equivalent to saying that the subset V of vertices of the new graph is a- well-linked. 

3 Cut Sparsifiers 

In this section we prove Theorem [Tj We fist consider a simpler special case where all edge capacities 
are unit (but parallel edges are allowed), in the following theorem. 

Theorem 5 Let G — (V,E) be any n-vertex (multi-) graph with unit edge capacities, and a set 
T C V of terminals. Let k = J2teT be the sum of degrees of all terminals. Then there is a 
quality-3 vertex cut sparsifier H — (V f ,E f ) for G, with \V f \ — 0(k 3 ). Moreover, graph H can be 
constructed in time poly(n) • 2 k . 

Proof: We assume w.l.o.g. that G is a connected graph: otherwise, we construct a sparsifier 
for each of its connected components separately. Let S = V \ T- Notice that | out (5)1 = k. In 
order to construct the sparsifier H, we compute a strong well-linked decomposition S of S, given 
by Theorem [ZJ Recall that the decomposition can be found in time 2 k • poly(n), and |<S| = 0(k 3 ). 
We now contract each set R E S into a single super- node vr. The resulting graph is the sparsifier 
H. Notice that H is an unweighted multi-graph, and \V(H) \ = 0(fc 3 ). 

Assume that we are given any partition (7a, 7b) of the set T of terminals. It is easy to see 
that MinCutc(7A, Tb) < MinCutij (7a, Tb)' let (X f ,Y f ) be the minimum cut, separating 7a from 
Tb in graph H. Since H is obtained from G by contracting some subsets of its vertices, the cut 
(X',Y f ) naturally induces a cut (X,Y) separating 7a from Tb in G: for each cluster R E <S, if 
vr E X\ then we add all vertices of R to X, and otherwise we add them to Y. The value of the 
cut, \E G (X,Y)\ = \E H (X',Y% and so MinCut G (7^, T B ) < MinCut#(7^ 7b). 

We now prove that MinCut/f (7a, Tb) < 3MinCutG(7A, 7b) • Let (X,Y) be the minimum 
cut separating 7a from 7b in G. We define a cut (X\Y f ), separating 7a from 7b in H, with 
\Eh (X', Y f )\ < 3\Eg(X, y)|, as follows, we start with the cut (X, Y) in graph G, and we gradually 
change this cut, so that eventually, for each set R E 5, all vertices of R are completely contained 
in either X or in Y . The resulting partition will then naturally define the cut (X 7 , Y') in graph H. 

We process the sets R E S one- by-one. Let R be any such set. Partition the edges of out (it!) 
into four subsets: Ex^Ey ,Exy ,Eyx, as follows. Let e = (u,v) E out(i?), where u E i?, v R. If 
both u and v belong to X, then e is added to Ex- If both vertices belong to Y, then e is added to 
Ey. If u belongs to X and v to Y, then e is added to Exy- Otherwise, it is added to Eyx (see 
Figure [TJ. Let E f R = E G (RnX,RnY). If \E X \ + \E X y\ < \Ey\ + \E Y x\, then we move all vertices 
of R to Y] otherwise we move them to X. 

Assume w.l.o.g. that \E X \ + \E X y\ < \Ey\ + \Eyx\, and so we have moved the vertices of 
R to Y. The only new edges that we have added to the cut are the edges of Ex- On the other 
hand, the edges of E' R , that belonged to the cut before the current iteration, do not belong to 
the cut anymore. We charge the edges of E' R for the edges of E x . Since set R is 1/3- well-linked, 
\E' R \ > \E x \/3 must hold, and so the charge to each edge of E R is at most 3. Moreover, since 
the edges of E' R are the inner edges of the set R (that is, both endpoints of each such edge belong 
to i?), we will never charge these edges again. Therefore, if (X,Y) denotes the final cut, after all 
clusters R E S have been processed, then \Eq(X,Y)\ < 3\Eq(X,Y)\. Finally, the cut (X,Y) in 
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Figure 1: Illustration for Theorem [5] 



graph G naturally defines a cut {X 1 \Y f ) in graph H: for each cluster R E 5, if R C X, then we 
add ^ to X 7 ; otherwise we add it to Y' . Clearly, Y')\ = \E G (X,Y)\ < 3\E G (X,Y)\. We 

conclude that MinCut#(74, 7b) < 3 MinCutc(7A, 7b). □ 

We now complete the proof of Theorem [lj Suppose we are given a graph G with arbitrary 
edge capacities c e > 1. For notational convenience, we denote the input parameter e by e', and 
we set e = e'/3. We perform the following transformation in graph G. Let C be the sum of the 
capacities of all edges incident on the terminals. For each edge e E E, if the capacity c e > C, then 
we set it to be C. Notice that this does not change the values MinCutc(7A, Tb) for any partition 
(7a, 7b) of the set T of the terminals, since MinCutc(7A, 7b) < C always holds. Finally, we replace 
each edge e E E with \c e /e] parallel unit-capacity edges. Let G 1 be the resulting graph. We now 
apply Theorem [H] to graph G\ to obtain a sparsifler H' of size 0((C/e) 3 ) = 0(C 3 ). We obtain a 
sparsifler H for graph G, by setting the capacity of every edge in H f to e. We now show that H is 
a quality- (3 + 3e) = (3 + e') -sparsifler for G. 

Notice that for each partition (7a, 7b) of T, MinCut ^ r ^- r ^) < MinCut G /(7^, 7b) < MinCut^/(7^, 7b) 
MinCut^(rA,r B ) ^ and gQ MinCutG (7^ ? 7^) < MinCut if (7^,7B). On the other hand, MinCut G /(7^, 7b) < 
MmCutG(7A,T g ) ^ _|_ e ^ s i nce a n original edge capacities c e > 1, and so < ^(1 + e). Therefore, 
MinCut^(7^,7B) = e • MinCut if /(74,TB) < 3eMinCut G /(7^, Tb) < 3(1 + e) MinCut G (7^, 7b). 

4 Flow Sparsifiers 

In this section we prove Theorem [2] and Corollary [l] We start with the following definition. 

Definition 2 Let S C V \ T be any subset of non-terminal vertices, and let \ out(*S)| = z. We 
say that S is a good router iff S is 1/3-well-linked, and every pair (e,e') E out(*S) of edges can 
simultaneously send 1/z flow units to each other inside S, with congestion at most 77* = 34. 

Notice that we can efficiently check whether S is a good router by computing an appropriate 
multicommodity flow in the graph Gs- Notice also that if S is a good router, then any 1-restricted 
set D of demands on the edges of out (S) can be routed with congestion at most 277* inside S. 
Indeed, let F be the flow, where each pair (e, e') E out(S') of edges sends 1/z flow units to each 
other with congestion at most 77* inside S. In order to route the set D of demands, consider any 
pair (e,e') E out (5) of edges. Edge e sends D{e 1 e r )/z flow units to each edge e" E out (5), using 
the flow F (scaled by factor D{e 1 e')) ) while edge e r collects D(e,e')/z flow units from each edge 
e" E out (5), using the flow F. In the end, we have i?(e, e f ) flow units sent from e to e 7 , and since 
the set D of demands is 1-restricted, the total congestion of this routing is bounded by 277*. 
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Definition 3 We say that a graph G' is a legal contracted graph for G iff there is a collection C 
of disjoint good routers, where the clusters S E C do not contain any terminals, and G' is obtained 
from G by contracting every cluster S E C into a super-node vs- (We remove self-loops, but leave 
parallel edges). 

It is easy to see that if G' is a legal contracted graph for G, then it is a quality-2?7* flow sparsifier, 
as the next claim shows. 

Claim 1 // G' is a legal contracted graph for G, then it is a quality-lrf restricted flow sparsifier. 

Proof: Given any set D of demands on the terminals in T, it is immediate to see that r\ (G', D) < 
rj(G,D), since G 1 is obtained from G by contracting some vertex subsets into super-nodes. 

Assume now that we are given some set D of demands on T, and T]{G f , D) = 77. For simplicity, 
we scale the demands in D down by the factor of 77, to obtain a new set D r of demands with 
rj(G , ^D / ) = 1. It is now enough to show that we can route the demands in D' in graph G with 
congestion at most 2?7*. Let F be the routing of D' in G r with congestion 1. For each cluster 
S E C, for each pair (e, e') E out (5) of edges, let Ds{e 1 e r ) be the total amount of flow in F sent 
on flow-paths that enter vs through edge e, and leave it through edge e r . We have thus obtained 
a set Ds of 1-restricted demands on the edges of out (S). Since S is a good router, these demands 
can be routed inside S with congestion at most 277* . Let F$ denote this routing. In order to obtain 
the final routing F' of the set D' of demands in G, we start with the flow F, and we augment it 
with the routings F$ that we have computed in each cluster S E C. Therefore, f](G,D f ) < 2rf . It 
is immediate to see that G' is a restricted sparsifier for G, from the definition of a legal contracted 
graph. □ 

Most of this section is devoted to proving the following theorem, which gives a construction of 
a flow sparsifier for the special case where the set V \ T is 1/3- well-linked. 

Theorem 6 Assume that we are given any (multi-)graph G — (V, E), with unit edge capacities and 
a subset T C V of k terminals, where every vertex in T has degree 1. Let R — V\T, and assume 
further that R is 1/3- well-linked. Then there is an algorithm that finds, in time 
restricted flow sparsifier H of quality q = 2rf for (G,T), such that \V(H)\ = &°( lo g lo g*0 ; an d i s 
a legal contracted graph for G. 



We defer the proof of Theorem [6] to Section 4^, and complete the proof of Theorem [2] here 



We assume w.l.o.g that G is a connected graph: otherwise, we compute a sparsifier for each of its 
connected components separately. Our first step is to compute a strong well-linked decomposition 
S of the set V \ T of vertices, given by Theorem |ij Recall that each set X E S is 1/3- well-linked, 
|<S| = 0(/c 3 ), and the decomposition can be computed in time 2 k • poly(n). For each edge e in 
set [J Xg<s out(X), we sub-divide e by a new vertex v e , and we let G' denote the resulting graph. 
For each cluster X E 5, let Tx — {v e | e E outc(X)}, and let Gx = G'[X U Tx\- Notice that 
\Tx\ < fc, and all vertices in Tx have degree 1 in Gx- For each cluster X E 5, we use Theorem [6] 
on graph Gx and the set Tx of terminals, to find a restricted flow sparsifier Hx- Let Cx be the 
corresponding collection of disjoint subsets of V{Gx) \ Tx, such that Hx is obtained from Gx by 
contracting every cluster in Cx- Let C = {JxeS^X- We obtain our final sparsifier H by contracting 
every cluster S E C into a super-node vs- Notice that since, for each cluster X E 5, graph Hx is a 
legal contracted graph for Gx, each cluster S E C is a good router, and so H is a legal contracted 
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graph for G. From Claim [I] H is a quality- (27/*) restricted sparsifler for G. It is easy to see that 
the running time of the algorithm is 2 k • n°^ ogk \ It now only remains to bound \V(H)\. 

Recall that |5| < 0(/c 3 ), and for each X e 5, |out(X)| < |T| = k. Therefore, \V{H X )\ = 
fc O(iogio g fc) 5 and \v(H)\ = 0(k 3 ) • fcOOogiog/c) = fe O(logiogfc). Thig completes the proof of Theorem^ 

The proof of Corollary [T] follows from Theorem [2] using standard techniques, and it appears in 
Section [C] of the Appendix. We now focus on the proof of Theorem |6j which is the main technical 
contribution of this section. 

4.1 Proof of Theorem [6] 

We prove the theorem by induction on the value of k. Throughout the proof, we use two parameters: 
r = 0(log 3 fc), and fc* = 2/crlogr = k poly log k. We set the value r to be a large enough integer, 
so that the following inequality holds: 

r > 24 j 8 PC G(fe*)/a w (fe*) (1) 

Notice that P FC G(k*)/a w (k*) = 0(log 5/2 (2fcr logr)) = 0(log 3 /c) + 0(log 3 (r logr)), so r = 
0(log 3 k) is sufficient. 

Next, we define a function F : R + — >> R + , where F{k') will roughly serve as an upper bound on 
the size of the sparsifler for any graph G with k r terminals. Function F is defined recursively, as 
follows. For k! < 4, F{k') = 1. If k! > 4 is an integral power of 2, then F{k f ) = 2 16 -r 3 logr-F(fc'/2) = 
O (log 9 A; log log k ) • F{k' /2). Otherwise, F{k') — F{k"), where k" is the smallest integral power of 
2 with k" > k! . Notice that for any integer k! > 4, F{k' /2) = F(\k f /2]), and we will sometimes 
use these values interchangeably. 

Notice that for all values /c', F(fc') = (\ogk , )°^ k '\ so = /c°( lo g lo g fc ) as required. From 

now on, we focus on proving that if G = (V, E) is a graph as in the theorem statement with k 
terminals, then we can find, in time 

n O(io g fc) . 2 * a 

restricted quality- (2rj*) sparsifler H for G, such 
that \V(H) \ T\ < F(k), and H is a legal contracted graph for G. 

The proof is by induction on the values of A:. If k < 4, then the set R = V \ T is a good 
router, so we can let C — {i?}, and return the corresponding contracted graph H as our sparsifler, 
so \V(H) \ T| = 1. Assume now that the claim holds for values k' < k, and we now prove it for k. 

Notice that if the set R = V\T of vertices is a good router, then we can set C = {i?}, and output 
a sparsifler H, obtained from G, after we contract the cluster R into a super-node vr. Therefore, 
we can assume from now on that R is not a good router. The main idea of the algorithm is as 
follows. Throughout the algorithm, we maintain a collection C of disjoint good routers in graph G 
and the corresponding legal contracted graph G f . At the beginning, C = 0, and G r — G. While the 
number of vertices in V(G') \ T is greater than F(k), we perform an iteration, in which we obtain 
a new collection C of disjoint good routers, such that the corresponding graph G" contains strictly 
fewer vertices than G' . Once the number of vertices in V{G') \ T falls below F{k), we stop and 
output G f as our sparsifler. 

Notice that if G r is a legal contracted graph for G, then each edge of G r corresponds to some 
edge of G. We do not distinguish between these edges. For example, if S C V{G r ) is any subset of 
vertices, and S' C V(G) is obtained from S by replacing each super-node vq E S by the vertices of 
C, then we view outG'(S) = outc(S' / ). We need the following definition. 

Definition 4 Let G f be the current legal contracted graph, and let S C V{G f ) \T be any subset 
of non-terminal vertices, such that G f [S] is connected. We say that S is a contractible set iff 
\out(S)\ < \k/2], and \S\ > 128F(| out G /(5)|). 
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Let G' be the current contracted graph, and let C be the corresponding collection of good routers. 
Suppose we can find a contractible set S of vertices in the current graph G', with | out(5)| = k 1 '. 
We show that in this case we can compute a smaller legal contracted graph G" . We denote this 
procedure by Contract(G / , S). Procedure Contract(G / , S) is executed as follows. Let Cs Q C contain 
all clusters C with vq E 5, and let S f be the subset of vertices of the original graph G obtained from 
S by replacing each super- node vc E Cs with the vertices of C. Clearly, \S'\ > 128F(/c / ) still holds, 
G[S f ] is a connected graph, and | out<3(S")| = kl . Let S be the strong well-linked decomposition of 
S / given by Theorem [4| We now process the clusters in S one by one. Consider some cluster Z E S. 
We construct a new graph Gz from graph G, by first sub-dividing every edge e E outc(Z) by a 
vertex v e , setting Tz — {v e \ e E outc(Z)}, and we let Gz be the sub-graph of the resulting graph 
induced by Z U Tz- Let kz = \Tz\ = \ onta(Z)\^ and observe that kz < k' < \k/2] < k. Recall 
that Gz is 1/3- well-linked for Tz, so by the induction hypothesis, we can find a sparsifier Hz for 
(Gz, Tz), with \V(Hz)\Tz\ < F(kz)- Let Cz be the collection of the good routers corresponding to 
Hz- Recall that each cluster C E Cz only contains vertices of Z. Let C — (C \ Cs) U (Uzg<s ^z) be 
the new collection of good routers in graph G, and let G" be the contracted graph corresponding 
to C . Graph G n is the output of procedure Contract(G / , S). In the next claim we show that 
\V(G")\ < \V(G% 

Claim 2 Let G" be the output of Procedure Contract(G', S). Then \ V[G")\ < \V{G')\. 
Proof: From the definition of G", 

V{G") = \V{G')\-\S\ + Y J \Hz\Tz\ 

zes 

<\V(G')\-\S\ + Y,F(kz). 

zes 

Let kl = \ out g'(S)\, and let k n be the smallest power of 2, such that k n > kl . Recall that 
\S\ > 128F(/c / ) = 128F(fc 7/ ), so in order to show that \V{G")\ < \V(G% it is enough to show that 
J2zeS F ( k z) < 128F(fc // ). For each i : 1 < i < logk" + 1, let Si C S be the collection of subsets 
Z E S with k" IT <k z < k"/2 i ~ 1 . Then from Theorem^ \Si\ < 2 3i+3 for all i. Therefore, 

log^'+l 

Y, F (kz)< E |«5i| ■ F(k"/2 i ~ 1 ) 
zeS i=i 

logfc"+l 

< J] 2 3i+3 • F(k"/2 i - 1 ) 

i=l 

Let T{i) = 2 3i - 3 F(k"/2 i ~ 1 ). Then 

T(i) = 8 • 2 3i - 6 F{k"/2 i ~ 1 ) < \ ■ 2 3i - 6 F(k"/2 i ~ 2 ) = - 1). 
Therefore, values T(i) form a geometrically decreasing sequence, 

and El=i fc +1 2 3i+3 -F(k"/2 i - 1 ) < 
2 7 -T(l) = 2 7 F{k"). □ 



12 



We now proceed to define two structures, that we call a type-1 and a type-2 witnesses. We show 
that if G' is a legal contracted graph for G, and G' contains either a type-1 or a type-2 witness, 
then R — V{G) \ T must be a good router. Finally, we show an algorithm, that, given a legal 
contracted graph G' with \V(G') \T\ > F(k), either finds a contractible subset S C V(G') \ T 
of vertices in G', or returns a type-1 or a type-2 witness in G' . Since we have assumed that R 
is not a good router, whenever we apply this algorithm to the current legal contracted graph G', 
we will obtain a contractible subset S of vertices, and by using procedure Contract(G / , £), we can 
obtain a new legal contracted graph G" with |y(G")| < |y(G')|. We continue this process until 
\V(G') \ T\ < F(k) holds, and output G' as our sparsifier then. We now proceed to define the two 
types of witnesses. 

Definition 5 Let G' be a legal contracted graph, and let T = {S[, . . . , S' r } be a family of disjoint 
subsets ofV(G f ) \T. We say that J 7 is a type-1 witness, iff for each 1 < j < r, S f - is aw (k*) -well- 
linked in graph G r , and there is a collection Vj of \k/2] edge-disjoint paths in graph G' , where each 
path connects a distinct terminal in T to a distinct edge in out^S^). 

Definition 6 Let A C V(G') \ T be any subset of non-terminal vertices. We say that A is a type-2 
witness iff we are given a subset E C outc?/(A) of r • |~fc/4] edges, such that A is aw(r ■ \k/A\)-well 
linked for E, and we are given a partition E\,...,E r of E into r disjoint subsets of size |~fc/4] 
each, and a subset T* ^ T of |~fc/4] terminals, such that for each 1 < j < r, there is a collection 
Vj : T* ^2 Ej of paths in graph G' . 

(Here we say that A is a- well-linked for E C outc?/(A) iff for any partition (X,Y) of A, if we 
denote T x = ^Hout G /(X), and T Y = Enout G >(Y), then \E G *(X,Y)\ > a • min{|T x |, \T Y \}.) 

We start by showing that if a legal contracted graph G r contains a type-1 witness or a type-2 
witness, then the set R is good router. 

Theorem 7 If any legal contracted graph G' contains a type-1 witness T , or a type-2 witness A, 
then R — V{G) \ T is a good router. 

Proof: Recall that R is 1/3- well-linked. So we only need to prove that if G' contains a type-1 
or a type-2 witness, then every pair of terminals can simultaneously send 1/k flow units to each 
other with congestion at most 77*. We need the following two simple claims, whose proofs appear 
in the Appendix. 

Claim 3 Let G' be a legal contracted graph, S' C V(G f ) \ T, and E' C outQ/(S'), such that S' is 
a-well-linked for E' , for any a < 1. Let S C V{G) \ T be the set of vertices obtained from S' , after 
we replace every super-node vq E S' with the set C of vertices. Then S is a/3-well-linked for E' 
in graph G. 

Claim 4 Let G' be a legal contracted graph for G, S' C V{G') \ T any subset of non-terminal 
vertices in G' , and E' C out ( ^/(S' / ) any subset of edges, and assume further that we are given a 
subset T' C T of terminals with \T'\ — \E'\, such that there is a collection V' : T' F' of paths 
in G' . Let S C V(G) \ T be the set of vertices obtained from S' after we replace every super-node 
vc by the set C of vertices, and consider the same subset E f C outc(S') of edges. Then there is a 
set V : T f ^37; E' of paths in graph G. 
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Type-1 Witnesses Assume first that graph G' contains a type-1 witness T — {S[, . . . , S' r }. Fix 
some 1 < j < r, and consider the subset Sj of vertices. Let Sj be the corresponding subset of 
vertices of the original graph G, after we un-contract each super-node vq E Sj, replacing it with 
the corresponding set C of vertices. Let Tj C T be the subset of \k/2] terminals that serve as 
endpoints of the paths in Vj, and let Ej C oMt G / (S'j ) be the subset of \k/2] edges where these 
paths terminate. From Claim [3j set Sj is aw (k* )/3- well-linked. Therefore, every pair (e, e 7 ) E Fj 
of edges can simultaneously send to each other at least prj^y ' 3pl^(k*) — ir now un ^ s with no 
congestion in G[Sj\. (We have used Equation [I]) . Denote this flow by Fj. From Claim [4j there is 
a set Vj : Tj ^3 Ej of paths in graph G. Let T- — T \ Tj. Then |7J 7 | < |7J| < k/2. Since graph 

G is 1/3- well-linked for T, there is a set : TJ ^3 Tj of paths in graph G. We now define a 
flow Fj, as follows: each terminal t E T- sends 1/r flow units to some terminal in Tj, along the 

path in V* that originates at t. Next, each terminal t' E Tj sends 2 ~ X J k flow units to some edge in 

Ej, using the path in Vj that originates at t f . Each edge in Ej now receives 2 ~^ fe flow units, and 
uses the flow Fj to spread this flow evenly among the edges of Ej . This defines the flow F* , where 
every pair (t, t') of terminals sends ^ flow units to each other. The congestion of the flow F* is 
computed as follows: the congestion due to flow on paths in V* is at most 3/r; the congestion due 
to flow on paths in Vj is at most 6/r, and the congestion due to the flow Fj is at most 1. Notice 
that flow Fj is entirely contained inside G[Sj\. 

The final flow F* is simply the union of flows Fj for 1 < j < r. Clearly, in F*, every pair of 
terminals sends \jk flow units to each other. It is easy to see that the flow congestion is bounded 
by 10. 

Type-2 Witnesses Assume now that we are given a type-2 witness A, and let A C V(G) \ T 
be the subset of vertices obtained from A, after we replace each super-node vq with the set C of 
vertices. From Claim [3J set A is ^awir \k/4] )-well-linked for the subset E C out^A) of edges. 

Therefore, every pair (e, e') E E of edges can send r .^^ • 3^q7^/41) > ^ flow units to each 
other with no congestion in graph G. (We have used Equation [j] and the fact that k* > r • \k/4~}). 
Let F denote this flow. Recall that for each 1 < j < r, we have a collection Vj : T* ^2 £j of paths 

in graph G '. From Claim |ij there is a set Vj : T* ^6 £j of paths in graph G. Finally, partition 
T\ T* into three subsets, 71,72,73 of size at most |~ft/4] each. Since graph G is 1/3- well-linked, 
for each 1 < i < 3, there is a set : % ^3 T* of paths in G. Let Q denote the following set of 
paths: start with Q\ U Q2 U Q3, and add, for each terminal t E T*, an empty path connecting 
t to itself. Then set Q contains, for each terminal t E T, a path Qt, connecting t to some terminal 
^ guc ] 1 that for each terminal E 7~*, there are exactly four terminals in T whose path Qt 
terminates at t r . Notice that the paths in Q cause congestion at most 9 in G. We are now ready 
to define our final flow F*. First, every terminal in T sends one flow unit to some terminal in T*, 
along the path Q t E Q. Next, for each 1 < j < r, each terminal t E T*, sends ^ flow units along 
the path in Vj that originates at t. Notice that each edge in E now receives £ flow units. Finally, 
we use the flow F, to spread the flow that every edge receives evenly among the edges in E, so 
every pair of edges in E needs to send ^ • r < flow units to each other. This finishes the 
definition of the flow F*. Clearly, every pair of terminals sends 1/k flow units to each other. We 
now analyze the congestion due to this flow. The paths in Q cause congestion 9, and the paths in 
V\, . . . , V r cause congestion at most 24 altogether (each set Vj of paths originally caused congestion 
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6, and we send 4/r flow units along each path in Vj). Finally, flow F causes congestion at most 1. 
Altogether, flow F* causes congestion at most 34. □ 
The next theorem provides an algorithm that, given any legal contracted graph G", either finds 
a contractible subset of vertices in G' , or finds a witness of type 1 or 2 in G' . 

Theorem 8 Let G f be any legal contracted graph, and assume that \ V(G f )\T\ > F(k). Then there 
is an efficient algorithm that finds either a contractible subset S f of vertices, or a type-1 witness J 7 , 
or a type- 2 witness A in graph G' . 

Proof: Since we only work with graph G' in this proof, we omit the sub-script G' in our 
notation, and use out(S') to denote out ( ^/(S'). Let S C V{G') \ T be any subset of non-terminal 
vertices. We say that a partition (X, Y) of S is balanced, iff |X|, \Y\ > \S\/A. We start with the 
following lemma. 

Lemma 1 Let S C V(G f ) \ T be any subset of non-terminal vertices with \S\ > 2 9 • F(k/2). Then 
there is an efficient algorithms that either finds a type-2 witness A, or a contractible set S' of 
vertices in G' , or a balanced partition (X, Y) of S with \E(X,Y)\ < rk. 

Proof: Let (X, Y) be any balanced partition of S, and assume w.l.o.g. that |X| > |y|. If 
\E{X, Y)\ < rk, then we stop and output the partition (X, Y). Otherwise, we perform a number of 
iterations. In each iteration, we are given as input a balanced partition (X, Y) of S with |X| > \Y\ 
and \E(X, Y)\ > rk, and we try to establish whether X is a type-2 witness. If this is not the case, 
then we will either find a contractible subset S f of vertices in G 1 or we will produce a new balanced 
partition (X',Y') of S, with \E(X' ,Y')\ < \E(X,Y)\. Therefore, after at most \E(G')\ steps, we are 
guaranteed to find a type-2 witness A, or a contractible set S f of vertices, or a balanced partition 
(X,y) of S with \E(X,Y)\ <rk. 

We now proceed to describe each iteration. Suppose we are given a balanced partition (X, Y) of 
S with |X| > |y| and \E(X, Y)\ > rk. Throughout the iteration execution, we denote V = E(X, Y). 
An iteration consists of three steps. In the first step, we try to find a collection V\ of |~fc/4] edge- 
disjoint paths in graph G f connecting |~fc/4] distinct terminals in T to a subset E\ of [fc/4] edges in 
T. In the second step, we identify additional (r — 1) subsets E2, . . .,E r of edges of V of size [fc/4] 
each, and try to find, for each 1 < j < r, a collection Vj of paths connecting terminals in T to 
the edges in Ej with congestion at most 2. Finally, in the third step, we set E = Uj=i Ej, and we 
try to establish whether X is aw{r • [fc/4])- well-linked for E. If all three steps succeed, then we 
output X as a type-2 witness. If any of the three steps fails, then we will either find a contractible 
set S' of vertices in G", or a new balanced partition (X',Y f ) of S with \E(X',Y')\ < \E(X,Y)\. 
In the latter case, we continue to the next iteration with the new partition (X f ,Y f ) replacing the 
partition (X, Y). We now turn to describe each of the three steps. 

Step 1 In this step we try to find a set V\ of edge-disjoint paths in graph G' connecting |~fc/4] 
distinct terminals in T to the edges of T. In order to do so, we set up the following flow network 
N. We sub-divide each edge e E T by a vertex z e , and set T' = {z e \ e E T}. We then contract the 
vertices of T into a source s, and the vertices of T' into a sink t. Assume first that there is an s-t 
flow of value at least \k/2] in the resulting network N. This flow defines a collection V' of \k/2] 
paths, where each path connects a distinct terminal in T to some edge in V (since each terminal in 
T has exactly one adjacent edge in G'). These paths are completely edge-disjoint, except that each 
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edge in T may serve as an endpoint of up to two such paths. We select a subset V\ Q V 1 of \k/A\ 
paths, such that each edge in T now participates in at most one path in that is, the paths in 
V\ are edge-disjoint. 

Assume now that the value of the maximum s-t flow in N is less than \k/2~\ . We show that in 
this case, we can either find a contractible set S' of vertices, or a balanced partition (X y \Y r ) of S 
with \E(X' ,Y')\ < \k/2] < rk. Since the value of the maximum s-t flow in N is less than [fc/2], 
there is an s-t cut (A',B f ) with s E A', t E B' \ and \E(A',B')\ < \k/2~\ in N. Let A = A f \ {s} 
and B = B' \ {t}. Then (A, B) is a partition of V(G') \ T. Denote X A = X n A, X B = X n B, 
Ya = Y D A, and Yb = Y fl B. Let T / C T be the subset of edges e = (u, v) where either u E A, 
v E or both u,v £ A (see Figure [2]). Notice that every edge in T f contributes at least 1 to the 
cut E N {A , 1 B , ) 1 and since Ec f (A, B) C B) U T 7 , we get that \E G >(A,B)\ < \k/2\. 



Assume first that \Xa\ > \Xb\- In this case, we define a new partition (X f ,Y f ) of 5, where 
X' = Xa and Y f = Y U Xb- It is immediate to see that (X f ,Y f ) is a balanced cut, since \X'\ > 
\X\/2 > 151/4. In order to bound E{X',Y') 1 observe that 

E GI {X'X) = E GI {X A ,X B )UE GI {X A ,Y) C ^(lA^sjur. 
Therefore, |£7 G '(X , ,y / )l < \E G >(X A ,X B )\ + |r 7 | < < kr. 

From now on we assume that \X B \ > \Xa\, so \X b \ > |5|/4 > 2 7 F(fc/2). Let Ci be the set 
of all connected components of G'[B\. If for any component C E Ci, \C\ > 2 7 F(/c/2), then we 
stop the algorithm, and output C as a contractible set. Indeed, \C\ > 2 7 F(k/2) 1 while |out(C)| < 
\E' G (A,B)\ < \k/2]. We now assume that for all components C E Ci, |C| < 2 7 F(k/2) < \S\/4. 

Let C2 be the set of all connected components of G'[Xb U Yb]- Notice that each connected 
component C E C2 must be contained in some connected component C E Ci, so |C| < |*5|/4 must 
hold. We construct a new partition (X 7 , Y 7 ) of 5, as follows. Start with X 7 = 0, and add components 
C E C2 to X 7 one- by-one, until |X 7 | > \S\/A holds. Since the size of each such component is less 



than |5|/4, while \X B \ > |5|/4, in the end, \S\/4 < \X f \ < \S\/2. Let Y f = S\X f . Then (X f , Y f ) is 
a balanced partition of S, and E G >(X f , Y f )\ C E G /(A, £>), so ^^(X 7 , Y 7 )l < |#G"(A B )I < f*/ 2 ! < 



Step 2 From now on, we assume that we have successfully found a set V\ of \k/A\ edge-disjoint 
paths connecting a subset T* C T of [fc/4] terminals to the edges in T. Let Ti be the subset of 




Figure 2: Illustration for Lemma [1} Edges in are shown in red. 



kr. 



[fe/4] edges of T that serve as endpoints of these paths, so V\ : T* 



1:1 _ 

-^1 1 1. 
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We select arbitrary (r — 1) disjoint subsets T2, . . . , T r of T \ Ti , containing [fc/4] edges of each. 
For each 2 < j < Tj, we will try to find a collection of edge-disjoint paths, connecting the edges 
of Ti to the edges of Tj. We will show that if such set of paths cannot be found, then we can 
find another balanced partition (X' \Y f ) of S with \E G r(X', Y f )\ < \E G >{X,Y)\. For simplicity, we 
provide and analyze the procedure for j ' = 2, and the procedure is similar for all 2 < j < r. 

We set up the following flow network. Start with the graph G'[X] U T\ U Y2- Let V\ be the 
set of the endpoints of edges of Ti that do not belong to X, V\ = {v \ (v,u) E Ti,v X}, and we 
define V2 similarly for T2. We then unify all vertices of V\ into a source s, and all vertices of V2 
into a sink t. Let N' be the resulting network. Assume first that there is an s-t flow in N' of value 
|~&/4~|. Then this flow defines a collection T r 2 of |~fc/4] edge-disjoint paths, connecting the edges of 
Ti to the edges of Y2- Concatenating the paths in V\ with the paths in we obtain a collection 
V 2 : T* ^2 T 2 of paths in G' . 

Assume now that such flow does not exist. Then there is an s-t cut (A, B) in TV', with s E A, 
t E B, and B)| < [fe/4]. We partition the edges of Ti into two subsets: set Ta denotes the 

edges that do not belong to the cut Ew(A, B) (that is, for each edge e = (5, v) E Ta, uGi), and 
set Tg denotes edges that belong to the cut (for each edge e = (s,v) E Tg, 7; E S). Similarly, we 
partition the set T2 of edges as follows: set T' A contains all edges that belong to the cut En>(A, B), 
and T' B contains all edges that do not belong to the cut. The set Y \ (Y\ U Y2) of edges is also 
partitioned into two subsets: Ta denotes all edges (u,v) E T with u E Y and v E A, and T# 
denotes all edges (u, v) E T with u eY and v E B. Finally, let E' = Eq'{A, B) (See Figure [3]). 




Figure 3: Illustration for Lemma [T] 

The set of edges that belong to the cut En>(A, B) is E' U Tg U T^, and the value of this cut 
is less than [fc/4]. In particular, since \T A UT B \ = \k/A\ and \T' A UT f B \ = [fc/4], it follows that 
l^l < |Ta|, and l^l < \T' B \. Assume first that \A\ < \B\. We then define a new partition {X 1 \Y f ) 
of S, where X' = B and Y' — Y U A. It is easy to see that (X', Y') is a balanced cut. Notice that 
E G *(X,Y) =T A UT f A UT B UT' B ur A ur Bl while E(X f ,Y f ) = T B UT f B UT B UE f . In order to show 
that \E{X',Y')\ < \E(X,Y)\, it is enough to prove that \E'\ < \T A \ + \T' A \, which follows from the 
fact that \E'\ < \T A \. 

Otherwise, if \A\ > \B\, we define a new partition (X 7 , Y f ) of S where X 1 — A and Y f — Y U B. 
Again, it is easy to see that (X 7 , Y f ) is a balanced cut. Notice that E(X, Y) = T A U T' A U T B U T' B U 
TaUT b , while E(X'X) = T A UT' A ur A UE f . In order to show that \E(X' ,Y')\ < \E(X,Y)\, it 
is enough to prove that < \T B \ + |T^|, which follows from the fact that \E f \ < \T A \. 

We say that steps 1 and 2 are successful iff we have found r disjoint subsets Ti, . . . ,T r of T 
containing \k/4] edges each, and for each 1 < j < r, we have found a set Vj of [fc/4] edge-disjoint 
paths, Vj : T* ^2 Tj. We assume from now on that steps 1 and 2 have been successful. We now 
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proceed to describe step 3. 

Step 3 Let T f = \Sj=\^j- In this step, we try to verify that X is aw(r • [fc/4])-well- linked for 
I". If this is not the case, then we return a balanced partition (X f ,Y f ) of S with \E G /(X r , Y r )\ < 
\E G /(X,Y)\. We set up an instance of the sparsest cut problem, as follows. Start with the graph 
G f and sub-divide every edge e E V r by a vertex v e . Let T r = {v e \ e E r 7 }, and let G 77 be the sub- 
graph of the resulting graph induced by X U T 7 . We run algorithm *4arv on the instance (G 77 , T 7 ) 
of the sparsest cut problem. Let A,B be the resulting partition of X, and assume w.l.o.g. that 
\A\ < \B\. Denote T A = out G t(A) fl r 7 and Tg = out G /(B) fl r 7 . Assume first that \E G >{A,B)\ < 
min{|T A |, \T B \}. We then define a new partition (X , 1 Y / ) of 5, where X f = B and Y f = A U Y\ 
It is easy to see that (X',Y f ) is a balanced partition, since \B\ > \A\. Moreover, \E G /(X', Y')\ < 
\E G ,(X,Y)\ - \T A \ + \Eg>(A,B)\ < \E G ,(X,Y)\ as required. 

Assume now that \E G f(A, B)\ > min{|T^|, |Tg|}. Then we are guaranteed that X is (l/a ARY (r- 
[fc/4])) > aw(r- [fc/4])- well-linked for r 7 . We then declare that X is a type-2 witness and terminate 
the algorithm. Indeed, we have established that X is aw(r ■ [fc/4])- well-linked for T 7 , and we have 
found, for each 1 < j < r, a collection Vj : T* ^2 Tj of paths in G 1 . □ 

We are now ready to complete the proof of Theorem [8j The algorithm consists of two phases. 
In the first phase we have |~logr] iterations. In each iteration z, we start with a family S{ of 2 l ~ 1 
disjoint subsets of vertices of V{G r ) \ T, where for each S E Si \S\ > 2 9 F(/c/2), and produce a 
family of 2 l subsets, that become an input to the next iteration. In the input to the first 
iteration, S\ — {V{G f ) \T}. Iteration i is executed as follows. Consider some set S E Si. We 
apply the algorithm from Lemma [l] to set S. If the output is a type-2 witness A, or a contractible 
set S' of vertices, we stop the algorithm and output this set. Otherwise, we obtain a balanced 
partition (X,Y) of 5, with \E(X,Y)\ < rk. In this case, we add X and Y to <%+i. Notice that 
I out(X)|, I out(y)| < I out +rk. We let Si+i be the set obtained after we process all sets S E Si. 
Observe that since we find balanced cuts in each iteration, for each 1 < i < [logr] +1, for each 
S E \S\ > lV fjy i > -^Ij > ^ > 2 9 • F(fc/2), and so we can indeed apply Lemma[l|to all 
sets in Si. 

Consider now the output of the last iteration <Spi ogr ] , and let Si, . . . , S r be any r sets in <S[i gr]+i- 
Fix some j : 1 < j < r, and consider the set Sj. From the above discussion, Sj C V{G f ) \ T. 
Moreover, since | out(V(G f ) \ T)| = fc, and in each iteration, if we start with a set S and produce 
a partition (X, Y) of 5, then | out(X)|, | out(y)| < | out (5)1 + rfc, we get that | out(5j)| < k + rk • 
[logr] < 2/crlogr = k*. Moreover, as observed above, \Sj\ > > 2 12 rlogr • F{k/2). 

Let Wj be the weak well-linked decomposition of Sj, given by Theorem |3j Notice that from 
the definition of well-linkedness, for every cluster C E Wj, G'[C] is connected. If any set R f E Wj, 
with I out (i? 7 ) I < \k/2~\ is contractible, then we simply output R 1 as a contractible set. From 
now on assume that all sets in Wj are non-contractible. Notice that for each R' E Wj, set R' is 
aw(^*)-well-linked. Let S f - E Wj be the set of maximum cardinality. We need the following claim. 

Claim 5 |5 7 | > 2 7 • F(k/2). 

Proof: Recall that from Theorem [3l 52 R , eW . \ out(R f )\ < 1.2|out(5j)| < 2.4/crlogr. We 
partition the set Wj into two subsets: W contains all sets R' E Wj with | out(i? 7 )| > fe/2, and W 2 
contains all remaining sets. Further, we partition the set W 2 into subsets Hi, for 2 < i < log k + 1, 
as follows: Hi contains all sets R' with k/2 i < \ out(i? 7 )| < fc/2 i_1 . 
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Fix some 2 < i < logfc + l. Since YlweWj \ ou ^(R f )\ < 2.4/crlogr, we get that \TZ{\ < 3rlogr-2 z , 
and since each set R! E Hi is non-contractible, \R'\ < 2 7 • F(k/2 l ~ 1 ) must hold. We therefore obtain 
the following bound: 

logfc+l 

J2 \ R '\- J2 3 • 2* • r logr • 2 7 • F(k/2 i ~ 1 ) 

logfc+1 

= 3-2 7 -rlogr 2 i F{k/2 i ~ 1 ) 

i=2 

Denote T(i) = 2 i F(k/2 i ~ 1 ). By the recursive definition of F(fc'), T(i) < ^F(k/2 i ~ 2 ) = T(i - 
l)/4. Therefore, the values T{i) form a geometric series, and Y^i=2 +l < 4T(2)/3. We conclude 
that E^gW 2 l^'l < 2 n rlogr -F(k/2) < \Sj\/2, and so E^gW 1 \ r '\ > \ S j\/ 2 ^ 2 n r logr • F(/c/2). 

Finally, observe that set W 1 may contain at most 5r log r sub-sets, since for each subset R' E W 1 , 
| out(R f )\ > fc/2, while X^'ew? lou^-R')! < 2Ark\ogr. Therefore, at least one subset in W 1 

contains more than 2 r ^{ Q ^ k ^ 2 " > > 2 7 • F(k/2) vertices. □ 

Next, we try to route the terminals in T to the edges in out(S^), as follows. We build a flow 
network, starting from the graph G\ contracting all terminals in T into a source 5, and all vertices 
in Sj into a sink t. We try to find an s-t flow in this network of value at least \k/2]. Assume 
first that we are unable to find such flow. Then we can find a minimum s-t cut (A,B), with 
s E A, t E £>, and the cut value is less than \k/2] in this network. Let B f = (B \ {£}) U Sj. 
Then B' C V{G') \ T \ and it is a contractible set, since |out G /(5')l < \ k / 2 ~], while S j ^ B ', so 
\B f \ > 2 7 • F(k/2). Moreover, G'[B'] is a connected graph, since G'[Sj\ is connected. 

Assume now that we have managed to find a flow of value at least \k/2] in this network. 
Then this defines a collection Vj of \k/2~\ edge-disjoint paths, connecting distinct terminals in T 
to distinct edges of out(S^). 

Overall, our algorithm may terminate early, in which case it is guaranteed to produce either a 
type-2 witness A, or a contractible set. Otherwise, the algorithm finds a family T — {S[, . . . , S f r } 
of vertex subsets that are aw(fc*)-well-rinked, with the collections V[, . . . ,V' r of paths as required, 
thus giving a type-1 witness. □ 

We are now ready to complete the proof of Theorem |6j If the set R — V(G) \ T is a good 
router, then we let C = {it!}, and the sparsifier H is the corresponding contracted graph (a star 
graph, where the star center is vr, and the leaves are the terminals). Assume now that R is not a 
good router. We then start with G' — G, and repeatedly apply Theorem [8] to G / . From Theorem [7J 
graph G' cannot contain type-1 or type-2 witnesses, so the output of the theorem will always be 
a contractible set S of vertices in G r . We then apply Procedure Contract(G / , S) to obtain a new 
contracted graph G", with ^(G")! < and continue. We are guaranteed to obtain, after at 

most |^(G)| iterations, a legal contracted graph G r with \V{G') \ T\ < F(k), which we output as 
the final sparsifier. From Claim [l] G f is indeed a quality- (2?7*)-sparsifler. 

In order to bound the running time of the algorithm, we prove that for any n- vertex graph with 
a set T of k terminals, the running time of the algorithm is T(n, k) = n°' log ^ • 2 k . The proof is 
by induction on the values of k. For k < 4, the running time of the algorithm is poly(n). Assume 
that the claim holds for all values k' < k, and we now prove it for k. Recall that our algorithm 
performs at most n iterations. Each iteration involves a call to procedure Contract(G / , 5), and 
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takes an additional time of poly(n). Let k r — | out (5)1, and recall that k' < \k/2\. Procedure 
Contract(G / , S) computes a strong well-linked decomposition S of the set S, and then computes a 
sparsifier for each set Z E S recursively. For each set Z E S, let kz = outc(Z), and let rtz — \Z\. 
Then kz < k' < [fc/2], and J2zeS n z < n - Therefore, by the induction hypothesis, the running 
time of the recursive procedure for each set Z E S is at most T(nz,kz) < T(nz, \k/2\), and 
the total running time of procedure Contract(G / , S) is at most 2 fc poly(n) + ^2zesT( n z, \k/2\) < 
2 k poly(n)+T(n, \k/2] ). Overall, the running time of the algorithm is then bounded by n-(poly(n) + 
2 fc poly(n) + T(n, \k/2])) < 2 k poly(n) + n • ( n 0(lo ^ k W) • 2 k ) < n °( lo ^ k ) . 2 fc = T(n, fc). 
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A List of Parameters 



«ARv(/c) 


O(Vlogk) 


Approximation factor of algorithm v4arv for sparsest cut 


a w (z) 


^(log 3 / 2 z) 


Well-linkedness parameter for the 
weak well-linked decomposition 


/3fcgO) 


O(logfc) 


Flow-cut gap for undirected graphs 


V* 


34 


Parameter from the definition of good routers 


r 


0(log 3 k) 


Number of vertex subsets in a type-1 witness 


k* 


2kr log r — k poly log k 
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B Proofs Omitted from Section [2] 



B.l Proof of Theorem M 

We use the o: A Rv(^)-approximation algorithm *4arv for the sparsest cut problem. We set clw( z ) — 

27a A RV L Wlogz = ^ ( ' 

Throughout the algorithm, we maintain a partition W of the input set 5 of vertices, where for 
each i? E W, | out(i?)| < | out (5)1. At the beginning, W consists of the subsets of 5 defined by the 
connected components of G[S]. 

Let R E W be any set in the current partition, and let (Gr, Tr) be the instance of the sparsest 
cut problem corresponding to R, as defined in Section [2j We say that a cut (A\ B f ) in Gr is sparse, 
iff its sparsity is less than l/(2 7 logz). We apply the algorithm *4arv to the instance (Gr,Tr) of 
sparsest cut. If the algorithm returns a cut (A f , £>'), that is a sparse cut, then let A = A' \ Tr, and 
B = B' ' \Tr. We remove R from W, and add A and B to it instead. Let T A = out(R) D out (A), 
andT B = out (12) flout (B), and assume w.l.o.g. that \T A \ < \T B \. Then \E(A,B)\ < |T A |/(2 7 logz) 
must hold, and in particular, |out(A)| < |out(£>)| < |out(i?)| < |out(*S)|. For accounting purposes, 
each edge in set T A is charged l/(2 7 logz) for the edges in E(A,B). Notice that the total charge 
to the edges in T A is |T A |/(2 7 logz) > \E(A,B)\. Notice also that since \T A \ < |out(E)|/2 and 
\E(A,B)\ < \T A \/2\ | out (A) | < 0.51 1 out (R)\. 

The algorithm stops when for each set R E W, the procedure ^Iarv returns a cut that is not 
sparse. We argue that this means that each set R E W is aw(z)- well-linked. Assume otherwise, 
and let R E W be a set that is not a^(z)-well-linked. Then, by the definition of well-linkedness, the 
corresponding instance of the sparsest cut problem must have a cut of sparsity less than olw{z) — 
l/(2 7 a ARV (z) logz). The algorithm Aabv should then have returned a cut whose sparsity is less 
than olw{z) • <^arv(^) — l/(2 7 logz), that is a sparse cut. 

Finally, we need to bound J^ReW I ou t(^) I • We use the charging scheme defined above. Consider 
some iteration where we partition the set R into two subsets A and £>, with \T A \ < \Tb\. Recall 
that each edge in T A is charged l/(2 7 logz) in this iteration, while | out(A)| < 0.51 1 out(i?)| holds. 
Consider some edge e = (u, v). Whenever e is charged via the vertex u, the size of the set out(i?), 
where u E R E W goes down by a factor of at least 0.51. Therefore, e can be charged at most 2 log z 
times via each of its endpoints. The total charge to e is then at most 41ogz/(2 7 logz) = 1/2 5 . This 
however only accounts for the direct charge. For example, some edge e f out (5), that was first 
charged to the edges in out (5), can in turn be charged for some other edges. We call such charging 
indirect. If we sum up the indirect charge for every edge e E out (5), we obtain a geometric series, 
and so the total direct and indirect amount charged to every edge e E out (5) is at most 1/2 4 < 0.1. 
Therefore, J^ReW I ou t(^)| < 1 - 2 1 out(*S)| (we need to count each edge e E (Ui^W out (it!)) \out(S f ) 
twice: once for each its endpoint). 

B.2 Proof of Theorem H 

The algorithm is very similar to the algorithm used in the proof of Theorem |3j but the analysis is 
different. We start by describing the algorithm. 

Throughout the algorithm, we maintain a partition S of the input set S of vertices, where for 
each ReS,\ ont(R)\ < | out(S% At the beginning, S = {S}. 

Let R E S be any set in the current partition, and let (Gr,Tr) be the corresponding instance 
of the sparsest cut problem. We say that a cut {A' , B f ) in Gr is sparse, iff its sparsity is less than 
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1/3. Notice that the set R is 1/3- well-linked iff there is no sparse cut in R 

Our algorithm proceeds as follows. Whenever there is a set R E <S, such that the sparsity of the 
sparsest cut (A\ B r ) in the corresponding instance (Gr,Tr) is less than 1/3, we set A = A' \ Tr 
and B = B' \ Tr. We then remove R from <S, and add A and B to it instead. Since | out(i?)| < 
| out (5) | < z, the sparsest cut problem instance can be solved in time 2 z poly(n): we simply go 
over all bi-partitions (7i, 75) of the set Tr of terminals, and for each such bi-partition, compute the 
minimum cut separating the vertices in 7i from the vertices in 75- The algorithm terminates when 
for every set R E 5, the value of the sparsest cut in the corresponding instance is at least 1/3. From 
the above discussion, the running time of the algorithm is 2 Z poly(n). It is also clear that when the 
algorithm terminates, for every set R E <S, |out(i?)| < | out (5)1 , and R is 1/3- well-linked. It now 
only remains to analyze the sizes of the collections Si of vertex subsets in the resulting partition, 
and to bound J^ReS I out(i?)|. 

Let C be the set of all cuts that we produce throughout this algorithm (that is, C contains all sets 
R that belonged to S at any stage of the algorithm), and let T be the corresponding partitioning 
tree, whose vertices represent the sets in C, and every inner vertex vr has exactly two children 
va,vb, where (A,B) is the partition that our procedure computed for the set R. 

For the sake of the analysis of the algorithm, we define a new graph G f , as follows. We start 
with the graph G, and the final partition S of S. Let R,R f €iS be any pair of distinct vertex 
subsets, and let e = (u, v) be any edge with u E R,v E R' . We subdivide the edge e by adding two 
vertices u e ,v e to it, so that now we have a path (u,u e ,v e ,v) instead of the edge e. Additionally, 
for every edge e = (u, v) with u E 5, v £ S, we subdivide edge e by adding a new vertex u e to it. 

All the newly added vertices are called terminals and the set of all such terminals is denoted 
by T. The resulting graph is denoted by G' . For each subset XCSof vertices, we will still denote 
by out(X) = outc(X). Consider now some subset R E C of vertices. We now define the subset 
T(R) C T of terminals associated with i?, as follows: T(R) = {u e \ e = (u, v) E out (R),u E R}, so 

\T(R)\ = | out(i2)|. Moreover, ^ReS I out (^)l = Znes \ T ( R )\ = \ T l 

We define a charging scheme that will help us bound the number of terminals, and the number 
of sets in each collection Si. 

We say that a set R E C belongs to level i iff < \ out(i?)| < 2( -_ z 1)/2 . In particular, S is a 
level-1 set, and we have at most 21ogz + 1 levels. We also partition all terminals into levels, and 
within each level, we have two types of terminals: regular and special. Intuitively, special terminals 
at level i are the terminals that have been created by partitioning some sets from levels 1, . . . , i — 1, 
while regular terminals are created by partitioning sets that belong to level i. 

The terminals in set T(S) are called special terminals, and they belong to level 1. Let R be 
some level-z set, and let A, B be its children, with | out(R) D out(A)| < | out(R) fl out(£>)|. Then 
\E(A,B)\ < |out(i2)nout(A)|/3 = \T(R) n r(A)|/3, and so \T(A) \ = \T(R) n r(A)| + \E(A, B)\ < 
\T(R)\(^ + \) < \T(R)\/y/2. Let i' and i" be the levels to which sets A and B belong, respectively. 
Then i' > i + 1 must hold. We call all terminals in set T(A) \ T(R) special terminals for level i' (to 
indicate that they were created from partitioning a lower-level set). If i" > i + 1 holds as well, then 
we call all terminals in set T(B) \ T(R) special terminals at level i" . Otherwise, they are regular 
terminals, that belong to level i" = i. 

Notice that both sets T(A) \ T(R) and T(B) \ T(R) of new terminals come from subdivisions of 
the edges in E(A,B), so each such edge gives rise to one terminal in T(A) and one in T(B). We 
stress that for each i, a level-i terminal continues to be a level-z terminal throughout the algorithm, 
even if its corresponding subset of vertices in S becomes further subdivided and stops being a level-i 
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set. So for example, if R is a level-i set, the terminals in T(R) may belong to levels 1, . . . , i. The 
next lemma bounds the number of terminals at each level, and it is central to the analysis of the 
algorithm. 

Lemma 2 For each 1 < i < 21ogz + 1, there are at most 2 l ~ 2 z special terminals, and at most 
2 l ~ 1 z regular level-i terminals. For i = 1, there are z level-1 special terminals, and at most z/2 
regular level-1 terminals. 

Proof: For each level i, let U{ be the number of special terminals, and n! { the number of regular 
terminals. Let S\ be the total number of terminals at levels 1, . . . ,i. We use the following two 
simple claims. 

Claim 6 For each i > 1, rii < S^_i/3. 

Proof: Recall that a level-i special terminal can only be created when partitioning some cluster 
R E C that belongs to levels 1, . . . , i — 1. Suppose that the partition of R is (A, £>), and assume that 
\T(A) nT(R)\ < \T(B)nT(R)\. Let X E {A, B} be the cluster that belongs to level-i (it is possible 
that both A and B belong to level i - the analysis for this case is similar and is carried out for each 
one of the clusters separately). Then the terminals in T{X) fl T(R) belong to levels 1, . . . ,i — 1, 
and the terminals in T(X) \ T(R) become special terminals at level i. We charge the terminals in 
T(X) n r(i?) for the terminals in T(X) \ where the charge to every terminal in T(X) D T(R) 

is at most 1/3. Moreover, since X now becomes a level-i cluster, the terminals in will never 

be charged for special level-i terminals again. Therefore, the number of special level-i terminals is 
at most Si-i/3. □ 

Claim 7 For every level i, n! i < (S^_i + r^)/2. 

Proof: The proof uses a charging scheme, that is defined as follows. Let V be the set of all 
terminals at levels 1, . . . , i — 1, together with the special terminals at level i. We charge the regular 
level-i terminals to the terminals in T 7 , and show that n! i < |T / |/2. 

Recall that regular level-i terminals are only created by partitioning level-i clusters R E C. Let R 
be any such level-i cluster, that we have partitioned into (A, B). Assume w.l.o.g. that \Ta\ < \Tb\, 
and recall that A must belong to some level i' > i. This partition only creates level-i terminals if B 
belongs to level i. The number of the newly created level-i terminals, \T(B) \ T(R)\ = \E(A, B)\ < 
\T A \/3 = \T(A)r\T(R)\/3. We charge the terminals in \T(A)nT(R) \ for the newly created terminals 
in r(2?) where each terminal in \T(A) nr(i?)| is charged 1/3. Observe that the terminals in 

\T(A) nT(R)\ must belong to levels 1, . . . , i, and moreover, since A does not belong to level i, these 
terminals will never be charged again for level-i terminals. But we may charge them indirectly - by 
charging the terminals in T(B) \ T(R) for some new terminals. However, since a direct charge to 
every terminal is bounded by 1/3, the total direct and indirect charge to any terminal in forms 
a geometrically decreasing sequence, and its sum is bounded by 1/2. Therefore, < |r 7 |/2. □ 

The number of special level-1 terminals is z, and the number of regular level-1 terminals is at 
most z/2 by Claim [7| so Si < 3z/2. In general, we now have that for any i > 1, ni < Si-i/3, 
and n- < (Si-i + n*)/2 < 25i_i/3. Therefore, Si = Si-i + rti + n • < 25f_i. We conclude that 
Si < 3 • 2 i ~ 2 z for all i, and so m < 2 i ~ 2 z, and n- < 2 l ~ Y z for all i > 1. □ 

Consider now the final partition S. We can now bound J2 R(E s |out(i?)| = |T| = l^iog^+il ^ 
3z-2 2l °z z = 0(z 3 ). 
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Let R E 5, and assume that R belongs to level i. Then all terminals of T(R) must belong to 
levels 1, . . . , i. The total number of such terminals is bounded by 3z • 2* -2 , and set R uses at least 
z/2 1 / 2 of them. It follows that the number of level- i sets in S is bounded by 2 3t / 2 . 

Recall that Si contains all sets R E S with z/2 l < | out(i?)| < z/2* -1 , and so Si only contains 
sets R E S that belong to levels 2i or 2i — 1. From the above discussion \S{\ < 2 3z+3 . 



C Proof of Corollary [T] 

Assume first that all edge capacities are integral and bounded by G. For each terminal t E T, let 
Ct be the total capacity of all edges incident on t. We replace every edge e E E with c e parallel 
edges of unit capacity. For each terminal t E T, we sub-divide each edge e incident on £ with a 
vertex u ej and we let St be the set of these new vertices. Let G be the resulting graph, and let 
G f — G\T. We let T r — [j tel -St be the set of terminals for the new graph G' . Then \T'\ — G, 
and each vertex in T f has exactly one edge incident to it. We now apply Theorem [2] to (G 1 \T f ) to 
obtain a sparsifler H 1 '. In our final step, for each t E T, we unify all vertices in the set St in graph 
H' into a single vertex t. Let H be this final sparsifler. Clearly, \V(H)\ = C°^ oglogC \ We now 
show that H is a quality- (27/*) sparsifler for G. 

Let D be any set of demands on T, and let F be the routing of these demands in graph G 
with congestion 77 = rj(G,D). Then the routing F naturally defines a set D r of demands over the 
vertices of T r . For each pair (i? e , i? e /) E T 7 of vertices, the demand D'{v e , v e t) is the total flow on all 
flow-paths in F that start at e and terminate at e 1 '. Flow F also gives a routing of this new set D r 
of demands in graph G' with congestion 77. Since i7 7 is a flow sparsifler for G 7 , there is a routing 
F' of the demands in D r in graph H r with congestion at most 77. This routing induces a routing of 
the set D of demands in graph H with congestion at most 77. 

The other direction is proved similarly: if D is any set of demands in T, and F is the routing 
of these demands in H with congestion 77 = rj(H,D), then F defines a set D' of demands on the 
vertices of T' exactly as before. Flow F then induces a routing of the set D' of demands in graph 
H r with congestion at most 77. Since H' is a quality-(277*) sparsifler for G', there is a routing F r of 
the set D' of demands in graph G' with congestion at most 77 • 277* . Flow F' then induces a routing 
of the set D of demands in graph G with congestion at most 77 • 277*. 

Finally, consider the general case, where the edge capacities c e > 1 are no longer required to be 
integral, and may not be bounded by G. For each edge e E E, if c e > G, then we set c e = C. It is 
easy to see that this transformation does not affect the values r?(G, D) for demand sets £>, since all 
flow in the network must traverse one of the edges incident to the terminals. Next, we define new 
edge capacities, by setting c' e — ^— c e . Let G r be the resulting graph. Notice that for each edge 

e, Zfce < 4 < **- c e + 1 < 

Consider some set D of demands defined over the set T of terminals. Let F be the routing 
of D in graph G 7 , whose congestion is r)(G f ,D). Consider the same flow F in graph G. The 
congestion caused by F on each edge e of G is bounded by < • 2y ^ +e . Therefore, rj(G, D) < 
7](G', D) ■ ?2l±£ for all D. 

Similarly, given any set D of demands, let F be the routing of D in graph G, whose congestion 
is r](G, D). Consider the same flow F in graph G'. The congestion on each edge e in G' is bounded 
by ^ < ^ • 2§5r. Therefore, r}(G',D) < r}[G,D) ■ ^ for all D. We conclude that for all D, 
^ri(G, D)< V (G', D) < ^ V (G, D). 
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Let H' be a quality- (27/*) sparsifier for graph G 7 , and let H be the graph obtained from H' by 
multiplying all edge capacities by the factor of Then for any set D of demands: 



and 



277* 277* 

r,( H , D) = -!~T](H', D) < —!—ri(G', D) < V (G, D); 
e e 



n(H,D) = ^fi,(H',D) 

~ 2ij* e n ' ' 

>^—v(G,D). 
2rj* + e 

We conclude that H is a quality-^* + e)-sparsifier for G, and \V(H) \ = C ( loglogC ). 

D Proof Omitted from Section 14.11 
D.l Proof of Claim M 

For simplicity, we build two new graphs H and H f , as follows. Sub-divide every edge e E with 
a vertex v e in both G and G 7 , and let T 7 — {v e | e E £"}. Let i7 be the sub-graph of the resulting 
graph obtained from G, induced by S U T 7 , and let ii 7 be the sub- graph of the corresponding 
graph obtained from G 7 , induced by S 7 U T 7 . Clearly, H' is a legal contracted graph for i7, and 
from the definition of well-linkedness, H' is a-well-linked for T 7 . We only need to prove that H is 
a/ 3- well-linked for T 7 . For any subset Z of vertices in either H or H f , let Tz = Z H T 7 . 

Assume for contradiction that ii 7 is not a- well-linked for T 7 , and let (X, 7) be the violating 
partition of V(fT), that is, |##(X,y)| < f min{|7x|, |7y|}. We construct a partition (X',y') of 
such that \Ej{f(X\ Y')\ < a-min{|7x'|, |7y'|}, contradicting the fact that H' is a- well-linked 

for 7^. 

Let C C be the collection of all clusters C, with vq £ S f . In order to construct the partition 
(X',Y f ) of VX-ff 7 ), we start with the partition (X,Y) of V(H), and process all clusters C E C# 
one-by-one. For each such cluster, we move all vertices of C either to X or to Y. Once we process 
all clusters in we will obtain obtain a partition (X,y) of V(i?), that will naturally define a 
partition (X',y') oiV(H'). 

Consider some cluster C E C#, and partition the edges in out#(C) into four subsets, Ex,Ey, 
Exy^Eyxi as follows. For each edge e = (u,v) E out#(C), with ^ E C, i? C, if both u,v E X, 
then we add e to £"x; if both u, v E Y, we add e to Ey\ if ^ E X, v E y, then we add e to Bjy, 
and otherwise we add it to Eyx- If \Ex\ + |^xy| < |£V| + I^Vxl, then we move all vertices of C 
to y, and otherwise we move them to X. Let Ec Q Ejj(X,Y) be the subset of the edges in the 
cut (X, y), with both endpoints in C, that is, E c = E H (X n C, Y n C). 

Assume w.l.o.g. that + I^xyI < + I ^Vxl, and so we have moved the vertices of C 
to y. The only new edges that have been added to the cut are the edges of Ex- Since set C is 
1/3- well-linked in H, \Ex\ < 3\Ec\- We charge the edges in Ec for the edges in Ex- The charge 
to every edge of Ec is at most 3, and since the edges of Ec have both endpoints inside the cluster 
C, we will never charge them again. 
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Once we process all super-nodes vq in this fashion, we obtain a partition (X, Y) of V(H), where 
for every cluster C G C^, all vertices of C belong to either lor 7. This cut naturally defines a 
partition {X 1 \Y f ) of the vertices of V(H'), where vc E X 1 iff C C X. From the above discussion, 
\E H <{X r ,Y')\ = \E H {X,Y)\ < 3\E H (X,Y)\. Moreover, since the terminals in V do not belong 
to any cluster C G C^, the partitions of the terminals of T r induced by the cuts (X,Y) in H and 
{X' \Y f ) in H' are identical. Therefore, \Eff/(X f , Y')\ < amin {|7x/|, iTy'll? a contradiction. 

D.2 Proof of Claim g] 

We set up the following two flow networks. For the first flow network, we start with the graph G. 
We add a source s, and connect it with a directed edge to every vertex v E V . For every edge 
e G we sub-divide e by adding a vertex z e to it, and connect z e to the sink t with a directed 
edge. We set the capacity of every edge in this network to be 3t/, except for the edges leaving s 
or entering t, whose capacities are set to 1. Let N\ denote the resulting flow network. In order to 
show the existence of the set V of paths, it is enough to show that the value of the maximum flow 
in network N\ is \T r \. For each edge e E E{N\), we denote by c(e) its capacity, and for each cut 
(X, Y) in the network, we denote by c(X, Y) the total capacity of edges connecting the vertices of 
X to the vertices of Y. 

The second network, N2, is constructed similarly, except that we use the contracted graph G', 
instead of the graph G. Specifically, we start with graph G', add a source s and a sink t. Source s 
connects with a directed edge to every vertex v E T y As before, we sub-divide every edge e E 
with a vertex z e , and connect z e to the sink t. The capacities of all edges in N2 are 77, except for 
the edges that leave the source s or enter the sink t, whose capacities are set to 1. Observe that the 
existence of the set V 1 of paths in graph G r guarantees that there is a flow of value \T r \ in network 
N2- For each edge e E £J(A^), we denote by c'{e) its capacity in N2, and for each cut (X,Y) in 
the network, we denote by c'(X, Y) the total capacity of edges connecting the vertices of X to the 
vertices of Y . 

It is now enough to prove that there is a flow of value \T f \ in network N\. Assume this is not 
the case. Then there is an s-t cut (X,Y) in network TVi, with c(X,Y) < \T'\. We show that there 
is an s-t cut (X f \Y r ) in network A^2, with d(X',Y') < |T ; |, contradicting the existence of the set 
V of paths. 

We consider the super-nodes vq E V(G f ) one-by-one. For each such super-node vc, we move 
all vertices of C either to X or to Y. The final cut, (X, Y) will naturally define an s-t cut (X', Y r ) 
in network A^2, and we will show that its capacity is less than \T'\. 

Consider some super-node vq E V{G r ). Recall that we are guaranteed that T' fl C — 0, so 
outiVi(C) = outc(C). Let E c = E G (C nI,Cfl Y). We partition the edges of outc(C) into four 
subsets, Ex containing edges with both endpoints in X, Ey containing edges with both endpoints 
in y, Exy containing edges e = (u, v) with u E XflC, v E Y\C, and Eyx containing the remaining 
edges. If \Ex \ + \Exy\ < \Ey\ + \Eyx\, then we move all vertices of C to Y, and otherwise we 
move them to X. 

Assume w.l.o.g. that \Ex \ + \Exy\ < \Ey\ + l-^yxl? so we have moved the vertices of C to Y. 
Since cluster C is 1/3- well-linked in graph G, \Ec\ > \Ex\/3. We charge the edges of Ec for the 
edges of Ex, with the charge to every edge of Ec being at most 3. Observe that none of the edges 
in Ec U Ex is incident on the source s or the sink t. 

Let (X,y) be the cut obtained after processing all super-nodes vc E V(G f ). Let Ei(X,Y) C 
E(X, Y) be the subset of edges incident on the source s or on the sink t in the original cut, and let 
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Ei(X, Y) be the subset of edges incident on the source s or on the sink t in the new cut. Since none 
of the clusters C we have considered contained vertices of T, or vertices z e for e G E\ |£?i(X, = 
Let £ 2 PW = and similarly, let ^ 2 (^,^) = f). 

From the above discussion, \E2(X,Y)\ < 3\E2(X,Y)\. Recall that the capacities of all edges in 
E 2 (X, Y) and E2(X, Y) are 37/, while the capacities of edges in Ei(X, Y) are 1, and we have assumed 
that c(X,Y) = |£?i(X,y)| +3r)\E 2 (X,Y)\ < |T'|. 

Cut (X,Y) naturally defines an s-t cut (X',Y f ) in network N2. The capacity of this cut 
in network N 2 is c\X',Y') = \Ei(X,Y)\ + ri\E 2 (X,Y)\ < \E X {X,Y)\ + Zr)\E 2 {X ,Y)\ < |T'|, a 
contradiction. 
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